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Abstract 

Let J 7 be a finite set of graphs. In the ^-Deletion problem, we are given an n- vertex, 
m-edge graph G and an integer k as input, and asked whether at most k vertices can be 
deleted from G such that the resulting graph does not contain a graph from J 7 as a minor. 
J 7 - Deletion is a generic problem and by selecting different sets of forbidden minors 
J 7 , one can obtain various fundamental problems such as Vertex Cover, Feedback 
^ Vertex Set or Treewidth ^-Deletion. 

In this paper we obtain a number of generic algorithmic results about ^-Deletion, 
when JF contains at least one planar graph. The highlights of our work are 

• A randomized 0{nm) time constant factor approximation algorithm for the opti- 
' mization version of ^-Deletion. 

£^ • A randomized 0{2°^n) parameterized algorithm for ^-Deletion when T is con- 

nected. Here a family T is called connected if every graph in J 7 is connected. The 
algorithm can be made deterministic at the cost of making the polynomial factor 

'— 1 in the running time n log 2 n rather than linear. 

t-H These algorithms unify, generalize, and improve over a multitude of results in the litera- 

ture. Our main results have several direct applications, but also the methods we develop 
on the way have applicability beyond the scope of this paper. Our results - constant 
factor approximation and FPT algorithms - are stringed together by a common theme 

T^j- of polynomial time preprocessing. 

^ 1 Introduction 

Let & be the set of all finite undirected graphs and let Jzf be the family of all finite subsets of 
. Thus every element T G Jz? is a finite set of graphs and throughout the paper we assume 
that J- is explicitly given. In this paper we study the following p- ^-Deletion problem. 



P-J'-Deletion Parameter: k 

Input: A graph G and a non- negative integer k. 

Question: Does there exist S C V(G), \S\ < k, such that G\S contains no graph from 
J 7 as a minor? 

The p-JF-Deletion problem defines a wide subclass of node (or vertex) removal problems 
studied from the 1970s. By the classical theorem of Lewis and Yannakakis [33J, deciding 
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if removing at most k vertices results with a subgraph with property ir is NP-complete for 
every non-trivial property n. By a celebrated result of Robertson and Seymour, every p-T- 
Deletion problem is non-uniformly fixed-parameter tractable (FPT). That is, for every k 
there is an algorithm solving the problem in time 0(f(k) • n 3 ) [H]. The importance of the 
result comes from the fact that it simultaneously gives FPT algorithms for a variety of impor- 
tant problems such as Vertex Cover, Feedback Vertex Set, Vertex Planarization, 
etc. It is conceivable that meta theorems for vertex deletion problems might be formulated 
by addressing problems that are expressible in logics such as first order and monadic second 
order. However, since these capture problems that are known to be intractable, for example 
Independent Set or Dominating Set, we do not expect to have a theorem that guarantees 
tractability for vertex deletion problems through this route. Therefore, the systematic study 
of the p- ^-Deletion problems is the more promising way forward to obtain meta-theorems 
for vertex removal problems on general undirected graphs. 

In this paper we show that when T E Jz? contains at least one planar graph, it is 
possible to obtain a number of generic results advancing known tractability borders of p- 
^-Deletion. The case when T contains a planar graph, while being considerably more 
restricted than the general case, already encompasses a number of the well-studied instances 
of p-.F-Deletion. For example, when T = {K2}, a complete graph on two vertices, this 
is the Vertex Cover problem. When T = {C3}, a cycle on three vertices, this is the 
Feedback Vertex Set problem. Another fundamental problem, which is a special case 
of p- ^-Deletion, is Treewidth ^-Deletion or 77-TRANSVERSAL which is to delete at 
most k vertices to obtain a graph of treewidth at most r/. Since any graph of treewidth r\ 
excludes a (77 + 1) x (rj + 1) grid as a minor, we have that the set J- of forbidden minors 
of treewidth 7/ graphs contains a planar graph. Treewidth 7?-Deletion plays important 
role in generic efficient polynomial time approximation schemes based on Bidimensionality 
Theory [25, 26j. Among other examples of p- ^-Deletion that can be found in the literature 
on approximation and parameterized algorithms, are the cases of J- being {^2,3)^4}) {-^4}; 
{0 C }, and {K$,T2}, which correspond to removing vertices to obtain an outerplanar graph, 
a series-parallel graph, a diamond graph, and a graph of pathwidth one, respectively. 

We call a family T 6 & connected if every graph in T is connected. The main algorithmic 
contributions of our work is the following set of results for p- ^-Deletion for the case that 
T contains a planar graph: 

• A randomized 0(nm) time constant factor approximation algorithm for the optimiza- 
tion version of ^-Deletion. 

• A randomized linear time and single exponential parameterized algorithm for p-T- 
Deletion when T is connected. That is, an algorithm running in time 0(2°( fc )n). 
The algorithm can be made deterministic at the cost of making the running time 
0(2°( fc )nlog 2 (n)) rather than 0{2°^n). 

We use & to denote the subclass of J2? such that every T E & contains a planar graph. Let 
us remark that for most interesting minor closed graph classes, the set T of forbidden minors 
is connected. Specifically, if a graph class II has the property that a graph G is in II whenever 
all G's connected components are, then the set of forbidden minors to II is connected. 

Methodology. All our results - constant factor approximation and FPT algorithms for 
p- ^-Deletion - have a common theme of polynomial time preprocessing. Preprocessing as 
a strategy for coping with hard problems is universally applied in practice and the notion of 
kernelization in parameterized complexity provides a mathematical framework for analyzing 
the quality of preprocessing strategies. In parameterized complexity each problem instance 
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Figure 1: General view of our approach 



comes with a parameter k and a central notion in parameterized complexity is fixed parame- 
ter tractability (FPT). This means, for a given instance (x, k), solvability in time f(k) -p(\x\), 
where / is an arbitrary function of k and p is a polynomial in the input size. The parame- 
terized problem is said to admit a polynomial kernel if there is a polynomial time algorithm 
(the degree of polynomial is independent of k), called a kernelization algorithm, that reduces 
the input instance down to an instance with size bounded by a polynomial p(k) in k, while 
preserving the answer. 

Thus the goal of kernelization is to apply reduction rules such that the size of the reduced 
instance can be upper bounded by a function of the parameter. However, if we want to use 
preprocessing for approximation or FPT algorithms, it is not necessary that the size of the 
reduced instance has to be upper bounded. What we need is a preprocessing procedure that 
allows us to navigate the solution search space efficiently. Our first contribution is a notion 
of preprocessing that is geared towards approximation and FPT algorithms. This notion 
relaxes the demands of kernelization and thus it is possible that a larger set of problems may 
admit this simplification procedure, when compared to kernelization. For approximation 
and FPT algorithms, we use the notion of a-cover as a measure of good preprocessing. For 
< a < 1, we say that a vertex subset S C V(G) is an a-cover, if the sum of vertex degrees 
YlveS^( v ) is a * least 2a\E(G)\. For example, every vertex cover of a graph is also a 1-cover. 
The defining property of this preprocessing is that the equivalent simplified instance of the 
problem admits some optimal solution which is also an a-cover. If we succeed with this goal, 
then for an edge selected uniformly at random, with a constant probability at least one of 
its endpoints belong to some optimal solution. Using this as a basic step, we can construct 
approximation and FPT algorithms. But how to achieve this kind of preprocessing? 

To achieve our goals we use the idea of graph replacement dating back to Fellows and 
Langston [21] . Precisely, what we use is the modern notion of "protrusion reduction" that has 
been recently employed in [7J E7J for obtaining meta- kernelization theorems for problems on 
sparse graphs like planar graphs, graphs of bounded genus [8], graphs excluding a fixed graph 
as a minor or induced subgraph [271 E3] , or graphs excluding a fixed graph as a topological 
minor [32]. In this method, we find a large protrusion - a graph of small treewidth and 
small boundary - and then the preprocessing rule replaces this protrusion by a protrusion 
of constant size. One repeatedly applies this until no longer possible. Finally, by using 
combinatorial arguments one upper bounds the size of the reduced induced (a graph without 
large protrusion) . The FPT algorithms use the replacement technique developed in [8] [23] , 
while for approximation algorithm we need another type of protrusion reduction. The reason 
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why the normal protrusion replacement does not work for approximation algorithms is the 
same as why the NP-hardness reduction is not always an approximation preserving reduction. 
While the normal protrusion replacement works fine for preserving exact solutions, we needed 
a notion of protrusion reduction that also preserves approximate solutions. To this end, 
we develop a new notion of lossless protrusion reduction, and show that several problems 
do admit lossless protrusion reductions. We exemplify the usefulness of the new concept 
by obtaining constant factor approximation algorithms for ^-Deletion. These FPT and 
approximation algorithms are obtained by showing that solutions to the instances of the 
problem that do not contain protrusion form an a-cover for some fixed constant a. 

As far as we are equipped with new tools and concepts: a-cover and lossless protrusion 
reduction, we are able to proceed with algorithms for p- ^-Deletion. These algorithms 
unify and generalize a multitude of results in the literature. In what follows we survey earlier 
results in each direction and discuss our results. 

Approximation. In the optimization version of p- ^-Deletion, we want to compute the 
minimum set S, which removal leaves input graph G J 7 - minor- free. We denote this optimiza- 
tion problem by J^-Deletion. Characterising graph properties for which the corresponding 
vertex deletion problem can be approximated within a constant factor is a long standing open 
problem in approximation algorithms [43 J • In spite of long history of research, we are still 
far from a complete understanding. Constant factor approximation algorithms for Vertex 
Cover are known since 1970s [36, 2j. Lund and Yannakakis observed that the vertex deletion 
problem for any hereditary property with a finite number of minimal forbidden subgraphs can 
be approximated with a constant ratio [3l]. They also conjectured that for every nontrivial, 
hereditary property with an infinite number of minimal forbidden subgraphs, the vertex dele- 
tion problem cannot be approximated with constant ratio. However, it appeared later that 
Feedback Vertex Set admits a constant factor approximation [3],[T] and thus the dividing 
line of approximability lies somewhere else. On a related matter, Yannakakis jl2] showed 
that approximating the number of vertices to delete in order to obtain connected graph with 
some property tt within factor n 1_£ is NP-hard, see |42| for the definition of the property tt. 
This result holds for very wide class of properties, in particular for properties being acyclic 
and outerplanar. There was no much progress on approximability /non-approximability of 
vertex deletion problems until recent work of Fiorini et al. [22] who gave a constant factor 
approximation algorithm for p- ^-Deletion for the case when J 7 is a diamond graph, i.e., a 
graph with two vertices and three parallel edges. 

Our first contribution is the theorem stating that every graph property tt expressible by 
a finite set of forbidden minors containing at least one planar graph, the vertex deletion 
problem for property tt admits a constant factor approximation algorithm. In other words, 
we prove the following theorem 

Theorem 1. For every set J- E ^ , ^-DELETION admits a randomized (Monte Carlo) 
constant ratio approximation algorithm with running time 0(nm). 

Let us remark that for all known constant factor approximation algorithms of vertex 
deletion to a hereditary property tt, property tt is either characterized by an finite number of 
minimal forbidden subgraphs or by finite number of forbidden minors, one of which is planar. 
Theorem [T] together with the result of Lund and Yannakakis, not only encompass all known 
vertex deletion problems with constant factor approximation ratio but significantly extends 
known tractability borders for such types of problems. 

Fast FPT Algorithms. The study of parameterized problems proceeds in several steps. 
The first step is to establish if the problem on hands is fixed parameter tractable or not. If 
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the problem is in FPT, then the next steps are to identify if the problem admits a polynomial 
kernel and to find the fastest possible FPT algorithm solving the problem. The running time 
of every FPT algorithm is 0(f(k)n c ), that is, the product of a super-polynomial function 
f(k) depending only on the parameter k and polynomial n c , where n is the input size and 
c is some constant. Both steps, minimizing super-polynomial function f(k) and minimizing 
the exponent c of the polynomial part, are important parts in the design and analysis of 
parameterized algorithms. 

The p- ^-Deletion problem was introduced by Fellows and Langston [20], who gave a 
non-constructive algorithm running in time 0(f(k) -n 2 ) for some function f(k) |2Q[ Theorem 
6]. This result was improved by Bodlaender [5] to 0(f(k) ■ n), for f(k) = 2 2 ° {klosk) . There is 
a substantial amount of work on improving the exponential function f(k) for special cases of 
P-J-*-Deletion. For the Vertex Cover problem the existence of single-exponential algo- 
rithms is well-known since almost the beginnings of the field of Parameterized Complexity, 
the current best algorithm being by Chen et al. [14] . Randomized parameterized single ex- 
ponential algorithm for Feedback Vertex Set was given by Becker et al. [I] but existence 
of deterministic single-exponential algorithms for Feedback Vertex Set was open for a 
while and it took some time and discovery of iterative compression [39J to reduce the running 
time to 2°( fc )n° (1) pH 031 QH Q3H EH1 EH] • The current champion for Feedback Vertex 
Set are the deterministic algorithm of Cao et al. [11] with running time 0(3.83 k kn 2 ) and 
the randomized of Cygan et al. with running time time 3 k n°^ |17| . Recently, Joret et 
al. [30] showed that p- ^-Deletion for F = {9 C }, where C is the graph with two vertices 
and c parallel edges, can be solved in time 2°^n°^ for every fixed c. Philip et al. [37J 
studied Pathwidth 1-Deletion and obtained an algorithm with running time 0(7 k n 2 ) 
that was later improved to 0(4.65 [18] . Kim et al. [31] gave a single exponential 

algorithm for F = {-K4}. Unless Exponential Time Hypothesis (ETH) fails [121 [29], single 
exponential dependence on the parameter k is asymptotically the best bound one can obtain 
for ^-.F-Deletion, and thus our next theorem provides asymptotically optimal bounds on 
the exponential function of the parameter and polynomial contribution of the input. 

Theorem 2. For every connected set F £ & containing a planar graph, there is a randomized 
(Monte Carlo) algorithm solving p-.F-DELETION in time 0(c k n) for some constant c > 1. 

We finally give a deterministic algorithm for ^-.F-Deletion. Surprisingly, our algorithm 
does not use iterative compression but is based on branching on degree sequences. 

Theorem 3. For every connected set F £ & containing a planar graph, p-F-Deletion is 
solvable in time 0(c k n log 2 n) for some constant c > 1. 

2 Preliminaries 

In this section we give various definitions which we use in the paper. We use V(G) to denote 
the vertex set of a graph G, and E[G) to denote the edge set. The degree of a vertex v in G 
is the number of edges incident on v, and is denoted by d{v). A graph G' is a subgraph of G 
if V(G') C V(G) and E(G') C E(G). The subgraph G' is called an induced subgraph of G if 
E{G') = {{u,v} G E(G) \u,ve V(G')}. Given a subset S C V(G) the subgraph induced by 
5 is denoted by G[S}. The subgraph induced by V(G) \ S is denoted by G \ S. We denote 
by Nq(S) the open neighborhood of S, i.e. the set of vertices in V(G) \ S adjacent to S. 
Whenever the graph G is clear from the context, we omit the subscript in Ng(S) and denote 
it only by N(S). By N[S] we denote N(S) U S. Let J 7 be a finite set of graphs. A vertex 
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subset S C V(G) of a graph G is said to be a F- deletion set ii G\S does not contain any 
graphs in the family J 7 as a minor. 

2.1 Parameterized algorithms and kernels. 

A parameterized problem II is a subset of V* x N for some finite alphabet T. An instance 
of a parameterized problem consists of (x,k), where k is called the parameter. We assume 
that k is given in unary and hence k < \x\. A central notion in parameterized complexity 
is fixed parameter tractability (FPT) which means, for a given instance (x, k), solvability in 
time f(k) ■ p(\x\), where / is an arbitrary function of k and p is a polynomial in the input 
size. The notion of kernelization is formally defined as follows. 

Definition 1. [Kernelization] Let IT C r* x N be a parameterized problem and g be a 
computable function. We say that IT admits a kernel of size g if there exists an algorithm 
K, called kernelization algorithm, or, in short, a kernelization, that given (x, k) G T* x N, 
outputs, in time polynomial in \x\ + k, a pair (V, k') G T* x N such that 

(a) (x, k) G IT if and only if {x' , k') G IT, and 

(b) m&x{\x'\,k'} < g(k). 

When g(k) = k°^ or g(k) = O(k) then we say that IT admits a polynomial or linear kernel 
respectively. If additionally k! < k we say that the kernel is strict. 

2.2 Treewidth. 

Let G be a graph. A tree decomposition of G is a pair T, X = {X t } t ^v(T)) where T is a tree 
and X is a collection of subsets of V(G) such that: 

• Ve = uv G E(G),3t G V(T) : {u, v} C X t and 

• Vi> £ ^(G), T[{t | v G Xt}] is a non-empty connected subtree of T. 

We call the vertices of T nodes and the sets in X bags of the tree decomposition (T, X). The 
width of (T, ^Y) is equal to max{|Aj| — 1 | t G V(T)} and the treewidth of G is the minimum 
width over all tree decompositions of G. 

A nice tree decomposition is a pair (T, ) where (T, X) is a tree decomposition such that 
T is a rooted tree and the following conditions are satisfied: 

• Every node of the tree T has at most two children; 

• if a node t has two children t\ and t2, then Xt = Xt t = Xt 2 ; and 

• if a node t has one child t±, then either \Xt\ = |X 1 | + 1 and Xt x C Xt (in this case 
we call t\ insert node) or \Xt\ = \X^ \ — 1 and Xt C A^ (in this case we call t\ insert 
node) . 

It is possible to transform a given tree decomposition (T, X ) into a nice tree decomposition 
(T',X') in time 0(|F| + \E\) 0. 
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2.3 Minors 



Given an edge e = xy of a graph G, the graph G/e is obtained from G by contracting the 
edge e, that is, the endpoints x and y are replaced by a new vertex v xy which is adjacent to 
the old neighbors of x and y (except from x and y). A graph H obtained by a sequence of 
edge-contractions is said to be a contraction of G. We denote it by H < c G. A graph H 
is a minor of a graph G if -ff is the contraction of some subgraph of G and we denote it by 
H < m G. We say that a graph G is H -minor-free when it does not contain H as a minor. 
We also say that a graph class Q is H -minor-free (or, excludes H as a minor) when all its 
members are ii-minor-free. It is well-known [JO] that if H < m G then tw(H) < tw(G). We 
will also use the following fact about excluding planar graphs as minors. 

Proposition 1. There is a constant c such that for every planar H and graph G with 
tw(G) > 2 C \ V ^\ 3 , H is a minor ofG. 

2.4 t-Boundaried graphs and Gluing. 

A f-boundaried graph is a graph G and a set B C V(G) of size at most t with each vertex 
v G B having a label £g(v) G {1, . . . ,t}. Each vertex in B has a unique label. We refer to 
B as the boundary of G. For a i-boundaried G the function 5(G) returns the boundary of 
G. Two t-boundaried graphs G and -ff are isomprphic if there is a bijection / from V(G) to 
V(H) such that G £(G) f(u)f(v) G £(iJ), for every u G 5(G) we have /(«) G 5(F) 

and £g( v ) = £ii(f( v ))- Specifically / is an isomorphism between G and i7 in the normal 
graph sense, but additionally / respects the labels of the border vertices. Observe that a 
t-boundaried graph may have no boundary at all. A graph G is isomorphic to a t-boundaried 
graph H of there is an isomorphism between G and H. 

Two f-boundaried graphs G\ and G2 can be glued together to form a graph G = Gi © G2. 
The gluing operation takes the disjoint union of G\ and G2 and identifies the vertices of 
5(Gi) and 5(G2) with the same label. If there are vertices u\, v\ G 5(Gi) and U2, V2 G 5{G2) 
such that ^GiC"i) = £g 2 ( u 2) and ^1(^1) = ^G 2 ( u 2) then G has vertices u formed by unifying 
u± and K2 and v formed by unifying v\ and V2- The new vertices u and v are adjacent if 
U1V1 G E(Gi) or U2V2 G E{G2)- 

The boundaried gluing operation (Bs is similar to the normal gluing operation, but results 
in a t-boundaried graph rather than a graph. Specifically Gi (Bs G2 results in a t-boundaried 
graph where the graph is G = G\ © G2 and a vertex is in the boundary of G if it was in the 
boundary of Gi or G2. Vertices in the boundary of G keep their label from Gi or G2. Both 
for gluing and boundaried gluing we will refer to G\ © G2 or G\ ©5 G2 as the sum of G\ and 
G2 , and Gi and G2 are the terms of the sum. 

For a t-boundaried graph G and boundary vertex v G 5(G), forgetting v results in a 
t-boundaried graph identical to G, except that v is no longer a boundary vertex. All other 
boundary vertices keep their labels. Forgetting a non-boundary vertex leaves the graph 
unchanged, as does forgetting a vertex that is not in the vertex set of G. Forgetting a set 
S C 5(G) of vertices means forgetting all vertices in the set. The function forget(G, S) 
returns the t-boundaried graph resulting from forgetting S in G. 

We will frequently need to construct i-boundaried graphs from subgraphs of a graph G. 
For a graph G and two disjoint vertex sets P and B we define Gp to be the t-boundaried 
graph G[P U B] with boundary B. The labelling of the border B is chosen in a manner 
independent of P - such that if Pi, P2 and B are disjoint then Gp (Bs Gp = Gp uP . 
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2.5 Monadic Second Order Logic (MSO) 

The syntax of MSO on graphs includes the logical connectives V, A, -i, 44>, =>, variables for 
vertices, edges, sets of vertices and sets of edges, the quantifiers V, 3 that can be applied to 
these variables, and the following five binary relations: 

1. u € U where u is a vertex variable and U is a vertex set variable; 

2. d £ D where d is an edge variable and D is an edge set variable; 

3. inc(d,u), where d is an edge variable, u is a vertex variable, and the interpretation is 
that the edge d is incident on the vertex u; 

4. adj(u, v), where u and v are vertex variables u, and the interpretation is that u and v 
are adjacent; 

5. equality of variables representing vertices, edges, set of vertices and set of edges. 

Many common graph-theoretic notions such as vertex degree, connectivity, planarity, 
being acyclic, and so on, can be expressed in MSO, as can be seen from introductory expo- 
sitions PES]. 

H minor-models. Recall that a i-boundaried graph H is a minor of a i-boundaried graph 
G if (a i-boundaried graph isomorphic to) H can be obtained from G by deleting vertices or 
edges or contracting edges, but never contracting edges with both endpoints being boundary 
vertices. Let V(H) = {h 1: . . . , h c }, and let B G := {bf, ...bf} and B H := {&f , . . . bf } denote 
8(G) and 5(H) respectively. Then, the formulation that H < m G is given by (j>(G, H, Bq, Bq)- 

<f>(G, H, Bq, B h ) = 3Xi, . . . , X c C V(G)[ 

f\(Xi n Xj = 0) A f\ Conn(G, X t )A 

/\ 3xGX i Ay£X J [(x,y)£E(G)]A 
{hi,hj)EE{H) 

[\ 3x £ Xi[x = bf] 
(bfeB H ) 

} (1) 

2.6 Finite Integer Index and Protrusions 

For a parameterized problem II and two t-boundaried graphs G±,G2 £ G, we say that 
G\ =n Gi if there exists a constant c such that for every t-boundaried graph G and for every 
integer k, (G\ © G,k) £ IT if and only if (G2 © G, k + c) £ II. For every t, the relation =n 
on t-boundaried graphs is an equivalence relation, and we call =n the canonical equivalence 
relation of IT. We say that a problem II has Finite Integer Index if for every t, =n has finite 
index on t-boundaried graphs. Thus, if IT has finite integer index then for every t there is a 
finite set S of t-boundaried graphs for every t-boundaried graph G± there exists G2 £ S such 
that G2 =n G\. Such a set S is called a set of representatives for (II, t). We will repeatedly 
make use of the following proposition. 

Proposition 2 (|8j). For every connected J- £ & , ^-Deletion has finite integer index. 
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Protrusions and Protrusion Replacement For a graph G and S C V(G), we define 
dc(S) as the set of vertices in S that have a neighbor in V(G) \ S. For a set S C V(G) 
the neighbourhood of S" is Ng(S) = dc(V(G) \ S). When it is clear from the context, we 
omit the subscripts. A r '-protrusion in a graph G is a set X C V such that |<9(X)| < r and 
tw(G[X]) < r. If G is a graph containing a r-protrusion X and X' is a r-boundaried graph, 
the act of replacing X by X' means replacing G by Gy^lsv © 

A protrusion replacer for a parameterized graph problem IT is a family of algorithms, 
with one algorithm for every constant r. The r'th algorithm has the following specifications. 
There exists a constant r' (which depends on r) such that given an instance (G, k) and an 
r-protrusion X in G of size at least r', the algorithm runs in time 0(|X|) and outputs an 
instance (G', k') such that (G', k') G II if and only if (G, k) G II, k' < k and G' is obtained from 
G by replacing X by a r-boundaried graph X' with less than r' vertices. Observe that since 
X has at least r' vertices and X' has less than r' vertices this implies that |V(G')| < |V(G)|. 
The following proposition is the driving force of [8] and the starting point for our algorithms. 

Proposition 3 (|SJ). Every parameterized problem with finite integer index has a protrusion 
replacer. 

Together, Propositions [2] and [3] imply that for every connected J- G ', ^-Deletion has 
a protrusion replacer. 

2.6.1 Least Common Ancestor-Closure of Sets in Trees. 

For a rooted tree T and vertex set M in V(T) the least common ancestor-closure {LCA- 
closure) LCA-closure(M) is obtained by the following process. Initially, set M' = M. 
Then, as long as there are vertices x and y in M' whose least common ancestor w is not in 
M 1 , add w to M'. When the process terminates, output M' as the LCA-closure of M. The 
following folklore lemma summarizes two basic properties of LCA closures. 

Lemma 1. Let T be a tree, M C V(T) and M' = LCA-closure(M). Then \M'\ < 2\M\ 
and for every connected component CofT\ M' , \N(C)\ < 2. 

Proof. To prove that \M'\ < 2\M\ make a tree T" with vertex set M', and for every vertex 
v G M' adding an edge to the lowermost ancestor of v in M' in the tree T. Observe that in 
T 1 all leaves are from M, since every vertex in M' \ M is the least common ancestor of two 
vertices below it in T. Furthermore, for the same reason every vertex in M' \ M has at least 
two decendants in T'. A standard counting argument for trees shows that the number of 
vertices with at least two decendants is at most the number of leaves. Hence \M' \M\ < \M\ 
and so \M'\ < 2\M\. 

We now prove that |iV(G)| < 2. Suppose not, and let r be the root of P. At most one of 
G's neighbours is the parent of r and hence at least two of G's neighbours, say u and v are 
children of vertices in G. The vertices u and v are both in M', and they are both descendents 
of r. But then the least common ancestor of u and v must lie in G and hence is not in M', 
contradicting the construction of M' . So we conclude that |iV(P)| < 2. □ 

3 A Randomized Algorithm for "connected" p- ^-Deletion 

In this section we give a randomized algorithm for p- ^-Deletion when every graph in 
J- G & is connected. Recall that we call a family T connected if all the graphs in T is 
connected. We will show that for every connected J- the algorithm runs in polynomial time, 
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with the exponent of the polynomial depending on the family P. If the input graph has a 
J-"-deletion set of size at most k, the algorithm will detect a ^-deletion set of size at most k 
with probability at least Here the constant c depends on P. The algorithm has no false 
positives - we show that if it reports that a J-"-deletion set of size at most k exists then G 
indeed has such a set. 

In the following sections we will progressively improve the algorithm; first we give an 
implementation of the algorithm with expected running time 0(n ■ OPT). Then we show 
how to modify the (sped up) algorithm so that it not only decides whether G has a ^-"-deletion 
set of size at most k, but also outputs a solution. We show that if G has a J-"-deletion set of size 
at most k, the algorithm will output a solution of size k with probability at least We then 
proceed to show that this algorithm in fact outputs constant factor approximate solutions 
with constant probability, yielding a constant factor approximation for p- ^-DELETION for 
connected P in expected 0(n ■ OPT) time. The main structure of the improved algorithm 
remains the same as the one described here. 

The first building block of our algorithm is a simple algorithm to reduce the input instance 
to an equivalent instance that does not contain any large protrusions with small border. 

Lemma 2. For every P G & and constants r and r' such that jkF-Deletion has a protru- 
sion replacer that reduces r -protrusions of size r' , there is an algorithm that takes as input 
an instance (G,k) of p- T '-Deletion , runs in n°^ r '^ time and outputs an equivalent instance 
(G' , k') such that \V{G')\ < V{G), k' < k and G' has no r-protrusion of size at least r' . 

Proof. It is sufficient to give a n°( r ) time algorithm to find a r-protrusion X in G of size at 
least r', if such a protrusion exists. If we had such an algorithm to find a protrusion we could 
keep looking for r- protrusions X in G of size at least r', and if one is found replacing them 
using the protrusion replacer. Since each replacement decreases the number of vertices by 
one we converge to an instance (G', k') with the desired properties after at most n iterations. 

To find an r-protrusion of size at least r' observe that if such a protrusion exists, then 
there must be at least one such protrusion X such that G[X\d(X)] has at most r' connected 
components. Indeed, if G[X \ d(X)] has more than r' connected components then let X' be 
d(X) plus the union of any r' components of G[X \ d(X)]. Now X' is an r-protrusion of size 
at least r' and G[X' \ d{X')\ has at most r' components. To find a r-protrusion X of size at 
least r' on at most r' components, guess d(X) and then guess which components of G\d(X) 
are in X. The size of the search space is bounded by ri r ■ n r and for each candidate X we 
can test whether it is a protrusion in linear time using Bodlaender's linear time treewidth 
algorithm [6]. □ 

The second building block of our algorithm is a lemma whose proof we postpone until 
the end of this section. The lemma states that for any P G if G contains no large 
protrusions with small border then any feasible solution to p- ^-Deletion is incident to a 
linear fraction of the edges of G. Recall that an a-cover in G is a set S such that YlveS d( v ) > 
a • E„eV(G) d ( v ) = 2a • m. 

Lemma 3. For every P G & there exist constants r and a such that if a graph G has no 
r-protrusion of size at least r' , then every P -deletion set S of G is a * -cover of G. 

We now combine Lemmata [2] and [3] to give a randomized algorithm for p- ^-Deletion 
for all P G & such that each graph in P is connected. 

Lemma 4. Algorithm^ runs in polynomial time, if (G,k) is a "no" instance it outputs 
"no" and if (G,k) is a "yes" instance it outputs "yes" with probability at least \ where c is 
a constant depending only on P. 



10 



Randomized-FPT-beta((G,/c)) 

Set G current • — G and ^current • — k. 

While {Gcurrent is not .F-free) do as follows: 

1. If k curren t < return that G does not have a £;-sized ^-"-deletion set . 

2. Apply Lemma[2]on (G curren t, k curren t) and obtain an equivalent instance (G', k'). 

3. Pick a vertex u € 1^(G) at random with probability Set G curren t ■= G' \ {u} 
and k curren i . — k 1 

Return that G has a /c-sized J-"-deletion set . 



Figure 2: In Algorithm Randomized-FPT-beta, let r be the constant as guaranteed by 
Lemma[3]and let r' be the smallest integer such that the protusion replacer for J-~-Deletion 
reduces r-protrusions of size r'. 



Proof. Since each iteration runs in polynomial time and reduces the number of vertices in 
Gcurrent by at least one, Algorithm [2] runs in polynomial time. Furthermore, Step [2] reduces 
the instance to an equivalent instance with k' < k curren t and Step [3] only decreases k current 
when it puts a vertex into the solution. Hence when the algorithm outputs "yes" then a 
fc-sized F-deletion set exists. It remains to show the last part of the statement. 

We say that an iteration of Step [3] is successful if there exists a ^-deletion set S of G' 
with 1 5 1 < k' such that the vertex u selected in this step is in S. If the step is successfull 
then S \ {u} is a ^-deletion set of G' of size at most k! — 1. Thus, if the input graph G 
has a /c-sized ^-deletion set and all the iterations of Step [3] are successful then the algorithm 
maintains the invariant that G curren t has a ^-deletion set of size at most k curren t, and thus 
after at most k iterations it terminates and outputs that (G, k) is a "yes" instance. When 
Step [3] is executed the graph G' has no r-protrusions of size at least r'. Thus by Lemma [3] 
every ^-deletion set set of G' is an ^-cover for a constant a depending only on T . Hence the 
probability that u is in a minimum size ^-deletion set of G' is at least °. We conclude that 
the probability that the first k executions of Step [3] are successful is at least {^r) k concluding 
the proof. □ 

Repeating the algorithm presented in Figure [2] 0{c k ) times yields a 0{2°^n°^) time 
algorithm for p- ^-Deletion for all connected J- £ ^ . However we are not entirely done 
with the proof of Lemma [4], as it remains to prove Lemma |3| In order to complete the proof 
we need to define protrusion decompositions. 

3.1 Protrusion Decompositions and Proof of Lemma [3] 

We recall the notion of a protrusion decomposition defined in [8] and show that if a graph 
G has a set X such that tw(G \ X) < d, then it admits a protrusion decomposition for an 
appropriate value of the parameters. We then use this result to prove Lemma [3j 

Definition 1. [Protrusion Decomposition] [|8j] A graph G has an (a, /3)-protrusion de- 
composition if V(G) has a partition V = {Rq, Ri, . . . , Rt} where 

• max{i, I -Roll < oc, 

• each Nc[Ri], i £ {1, • • • ,t} is a (3-protrusion of G, and 



11 



• for all i > 1, N[Ri] C R . 

We call the sets = Ng[R%[, i £ {1, ■ ■ ■ ,t} protrusions ofV. 

We now show that for every T G & every graph with an ^-deletion set X has an (a, f3)- 
protrusion decomposition where j3 is constant and a = 0{\N[X]\). 

Lemma 5 (Protrusion Decomposition Lemma). If a n-vertex graph G has a vertex subset 
X such that tw(G \ X) < b, then G admits a {{4\N[X]\)(b + 1), 2(6 + 1)) -protrusion decom- 
position. Furthermore, if we are given the set X then this protrusion decomposition can be 
computed in linear time. Here b is a constant. 

Proof. We give a proof for the case when X is explicitly given to us. The proof will auto- 
matically imply the existence of a (4(6+ l)|iV(X)|, 2b + 2)-protrusion decomposition of G for 
the case when we are just guaranteed the existence of X. The algorithm starts by computing 
a nice tree decomposition (T, B) of G \ X with width at most b. Notice that since b is a 
constant this can be done in linear time [6]. 

For every v € N(X) add a node u in T such that v £ B u to a set M' . We have that 
M' < \N(X)\. Let M' be the set of marked nodes and set M = LCA-closure(M'). By 
Lemma| M < 2\M'\ < 2\N(X)\. Let Q u Q 2 . . . Q t be the connected components of T \ Q. 
Since T is a binary tree T\M has at most 2\M\ + 1 connected components, so t < 4\N(X)\ + 1. 
By Lemma[l]we have that for every i < t, \NT(Qi)\ < 2. 

Define Ro =XU(J ugM B u and for each 1 < i < t set i?j = \J ue g. B u \ Rq. Since every 
vertex of G \ X appears in a bag of the tree-decomposition, Rq, . . . R t forms a partition of 
V(G). By construction we have that for every i > 1, N(Ri) C R and tw(G[A7"[i?j]]) < b. 
Furthermore, since \N T (Qt)\ < 2 we have \N(Ri)\ < 2(6 + 1). Thus R . . . R t form a (a, 0)- 
protrusion decomposition of G where /3 < 2(6 + 1) and a < max(\Ro\,t) < (A\N[X]\)(b + 1). 
It is easy to implement a procedure that computes Ro . . . Rt in this way in linear time. □ 

We are now in a position to prove Lemma [3] 

Proof of Lemma [3| We need to prove that for every T G & there exist constants r and a 
such that if a graph G has no r-protrusion of size at least r', then every minimal J^-deletion 
set S of G is a p-cover of G. By Proposition [T] there exists a constant r\ depending only on 
T such that tw(G \S) <rj. By Lemma [5j G has a ((4|iV[5]|)(ry + 1), 2(r? + l))-protrusion 
decomposition Rq . . . Rt. Set r = 2(r/ + 1) and suppose G has no r-protrusions of size at least 
r'. Then t < (4|JV[5]|)(r/ + 1), |« | < (4|iV[5]|)(r ? + 1) and so \V{G)\ = \R \ + ^ 1^1 < 
{A\N[S]\)(n + l)(r' + 1) < (&\N[S\\)(ri + l)r'. Since tw(G \ 5*) < 77 + 1 it follows that G \ S 
is (7? + l)-degenerate and so J2veV(G)\s d ( v ) ^ ( 8 l Ar [ 5 ']l)( r / + 1 ) 2 ' r '- Set « = is^+i) 2 and 
observe that 

d(v)<^2d(v)+ Yl d(v)<Y / d(v) + (8\N[^\)( V + l) 2 r'<^-Y / d(v). 

v&V(G) v£S veV(G)\S v£S v£S 

The last inequality follows from the fact that there are no isolated vertex in S. □ 

4 Fast Protrusion Replacement 

What makes the polynomial factor of Algorithm [2] large is the algorithm of Lemma [2] to 
remove all large enough protrusions with small border size. In this section we give much 
faster algorithms that reduce "almost all" large protrusions with small border. We then 
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show that reducing almost all protrusions instead of all protrusions is sufficient to obtain the 
conclusion of Lemma [3j The "fast protrusion reduction" algorithms we design in this section 
are applicable to any problem that uses protrusion reducer, and hence they are useful well 
beyond the scope of this paper. We give two algorithms for fast protrusion replacement, a 
randomized algorithm and a slightly slower deterministic algorithm. 

The Randomized Fast Protrusion Replacer. We now describe an algorithm that we 
call the Randomized Fast Protrusion Replacer (RFPR). The algorithm works for parameter- 
ized graph problems II that have a protrusion replacer, takes as input an instance (G, k) and 
outputs another instance (G', k'). Just as a normal protrusion replacer, the RFPR is actually 
a family of algorithms with one algorithm for each value of the integer r. We describe how 
the algorithm proceeds for a fixed value of r. Let r' be the smallest integer such that the 
protrusion replacer for LT replaces r- protrusions of size at least r'. 

The RFPR proceeds as follows. We select a random partition of V(G) into r + 1 sets 
X\,X2, ■ ■ ■ X r+ \. For every i < r + 1 we compute the connected components of G[Xj] and 
add these components to a collection C . This results in a partition of V(G) into C = 
C[, C'2, ■ . . C' t ,. Now, discard every component C[ such that N{C[) > r and every component 
Ci such that tw(G[A r [Cj]]) > r. Discarding all of these components can be done in linear time 
- the only computationally hard step is to check whether the treewidth of the components 
is at most r, this can be done in linear time using Bodlaenders's algorithm [6]. Let C* = 

, . . . , C 4 ** be the remaining components. 

For every C* E C* , N[C*] is a r-protrusion in G. However some, if not all of the compo- 
nents in C* could have less than r' vertices and so the protrusion replacer can't reduce them. 
However it could be possible to group some components in C* with the same neighbourhood 
together such that their union is a protrusion that is large enough to be reduced. From C* 
we will compute a collection TZ of disjoint vertex sets such that for every R E TZ, N[R] is an 
r-protrusion in G of size at least r' . Our aim is to compute such a set with \7Z\ being large. 
For every component C* E C* of size at least r' we add G* to TZ and remove C* from C* . Let 
C = C\ . . . Ct be the remaining components. All components in C have size at most r' . Set 
Rbig to be the number of components C* E C* on at least r' vertices that are added to TZ. 

Now we partition C into groups according to the neighbourhood of the components. 
Specifically we compute a partition of C into Z\,...Z q such that for every pair C, G C, 
Ci' E C such that N{Ci) = N(Ci'), Ci and CV are in the same Zj, while for every pair 
Ci E C, Cj/ E C such that N(Ci) 7^ iV(Cj/) we have Ci E Zj — > Cy E" Zj. Such a partition can 
be computed in time 0(nr) because every component in C has at most r neighbours; First 
we sort the neighbor lists of each component according to some ordering of the vertex set, 
for example an arbitrary labelling of the vertices from 1 to n. Then we do r stable bucket 
sorts on C sorting the components first on their first neighbour, then their second neighbor, 
etc. 

Having computed the partitioning Z\, . . . Z q we now compute TZ as follows. As long 
as there is a Zi such that YuC ez- l^il — r ' select a minimal collection Z C Z% such that 
J2c ez > r ' ■ Add \Jc ez^j *° ^ an d remove the components of Z from Zi. This 
procedure can easily be implemented in linear time. This concludes the construction of TZ. 

Given TZ we proceed as follows, for a set R E TZ we run the protrusion replacer for n 
on (G,k) with protrusion N[R\. The protrusion replacer outputs an equivalent instance 
(G* , k*) with |V(G*)| < |V(G)|. Here G* is a graph where R has been replaced by a smaller 
protrusion R' . Since all the sets in TZ are disjoint, the other sets in TZ are now r-protrusions 
in G* of size at least r' . Thus we can run the protrusion replacer on all the sets in TZ. This 
takes time YlRen^(\^\) = 0(n). Let (G',k') be the instance obtained after running the 
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protrusion replacer on all the sets in TZ. The RFPR outputs the instance (G', k'). We collect 
a few simple facts about the RFPR in the following lemma. 



Lemma 6. Given an instance (G, k), the RFPR runs in time 0{n+m), computes a collection 
TZ of protrusions and and outputs an equivalent instance (G", k'), such that \V{G')\ < \V{G)\— 

2^i<q 2(r'-l) 



\TZ\. Furthermore TZ > Rbi q + 



'.</ 

Proof. The instances (G, k) and (G', k') are equivalent because (G', k') is obtained from (G, k) 
by repetitive applications of a protrusion replacer. In the description of the algorithm we 
made sure that each individual stage of the algorithm runs in linear time. Finally, each 
application of the protrusion replacer reduces the size of the graph by at least one. We apply 
the protrusion replacer \TZ\ times. Hence |V(G')| < |V(G)| — \TZ\. 

Finally, when the RFPR selects a minimal collection Z C Z{ such that Ylc-eZ \Q\ > r \ 
since each Cj £ Z{ has size at most r' it follows that Y2c ez \Q\ — ^( r ' ~~ -0- Thus every 
time we add a set to TZ, J2ceZi \C\ decreases by at most 2(r' — 1). At the end when we can 
not add more sets to 1Z we have that for every i, Ylc^z l^l — r> ■ This proves the last part 
of the statement of the lemma. □ 

Analyzing the Randomized Fast Protrusion Replacer. We now analyze how many 
vertices the Fast Protrusion Replacer reduces the instance by. To that end we need to define 
the notion protrusion covers. 

Definition 2. An (a, b, r) -protrusion cover in a graph G is a collection Z = Z\,...,Zt of 
sets such that for every i, N\Zj\ is a r-protrusion in G and a < \Z{\ < b, and for every i ^ j, 
Z{ n Zj = and there are no edges from Z{ to Zj. The size of Z is \Z\. 

Lemma 7. Let H be a problem that has a protrusion replacer which replaces r -protrusions 
of size at least r' , and let s > r' ■ 2 r . If G is a graph with a (s, 6 s,r) -protrusion cover X, 



then if the RFPR is run on (G, k), with probabilty at least 1 — e 8 ( r+1 ) 6s the output instance 
(G',k f ) satisfies \V(G)\ - \V(G')\ > 



4(r+l) 

Proof. By Lemmap^the RFPR computes a set 1Z of protrusions and |V(G)| — |V(G')| > \1Z\. 
Thus it is sufficient to show that with high probablility, 1Z > 4( - r+ ^ 6s . Define X = V(G) \ 
[JxexX- Since no edge goes between different sets in X we have that for every X E X, 
N(X) C X. The only randomized step of the RFPR is the initial partitioning of V(G) into 
sets Xi, . . . X r+ \. We may think of this partitioning step as selecting a random coloring of 
V(G) with colors from {1, . . . , r + 1}. 

We say that a set X in X succeeds if all vertices in X are colored with the same color, 
and no vertex of N(X) is colored with that color. Since every set X £ X has at most r 
neighbours we have that the probability that X succeeds given any coloring of X is at least 
(r+i)l-xi " Hence the expected number of sets X G X that succeed is at least ^pjpi • Suppose 
t sets succeed. We prove that the set TZ constructed by the Randomized Fast Protrusion 
Replacer has size at least t/2. 

For each set X that succeeds, the connected components of X are added to C , and since 
they all have treewidth at most r and have at most r neighbors, none of them are discarded. 
Hence the connected components of X are all added to C*. Since \Z\ > r' ■ 2 r , if we group 
the connected components of Z by their neighbourhood, at least one group has combined 
size at least r'. If this group contains a connected component on at least r' vertices then 
this component is added to TZ directly and X contributes one to Ru g - If this group does 
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not contain any components of size at least r' then the group is added in its entirety to 
some set Z{. In this case the group contributes at least r' to Ylcez- l^l- By Lemma pi 



> Rbia + 



Hence the total number of sets added to 1Z is at least 



v Eceg, |C|-(r'-l) 
^big "I" L^i<q 2(r'-l) 

t/2. 

Since the neighbourhoods of different sets in X may overlap there are dependencies be- 
tween which sets succeed. However, given any coloring of X the success of different sets in 
X is independent, since whether X succeeds or not depends only on the color of vertices in 
X and the color of vertices in N{X) C X. Thus for every coloring of X the number of sets 
that succeed is a sum of independent 0-1 variables taking value 1 with probability at least 
7 — Ittyt- Standard Chernoff bounds for the binomial distribution show that if T is a sum 

(r+l)l A l 

of n independent 0-1 variables taking value 1 with probabily p, then P[X < np/2] < e~~s". 
Plugging this in for the number of sets in X that succeed yields that the probability that 

\n\ < 4(r g) )6a is at most e~^^. □ 

The Deterministic Fast Protrusion Replacer We prove that the RFPR can be made 
deterministic at the cost of a logn factor in the running time. The only randomized step of 
the RFPR is the initial step where the vertices of G are partitioned into r+1 sets Xi,... X r+ \. 
We may think of this partitioning step as selecting a random coloring of V{G) with colors 
from {1, . . . , r + 1}. The main difference between the randomized and the deterministic 
Fast Protrusion Replacer is how this coloring is chosen. The Deterministic Fast Protusion 
Replacer only partitions V(G) in two sets X\ and X2 - this corresponds to coloring the 
vertices with colors 1 and 2. To describe the colorings the Deterministic Fast Protrusion 
Replacer (DFPR) uses we use the notion of universal sets. 

Definition 2 (|35j). A (n, i)-universal set V of a ground set U on n elements is a collection 
V of subsets of U such that for every set S C U and set S' C S there is a set P G V such 
thatPHS = S'. 

Theorem 4 (|35j). There is a deterministic algorithm with running time 0(2 t+0 ^n\ogn) 
that constructs an (n,t) -universal set V such that \V\ = 2 t+0 logn. 

The DFPR has two parameters, r and s, instead of just one parameter r. It constructs a 
(n, 6s+r)-universal set V in time 0{2 &s+T+ °^ s+r ^n log n) = O(2 20s n log n) and selects the first 
set P € V- It sets X\ = P, X2 = V(G)\P and then it proceeds just as the RFPR would. For a 
fixed set P G V this takes linear time and will reduce (G, k) to an equivalent instance (G', k'). 
Choosing different sets P £ V results in different output instances (G', k'). The DFPR tries 
all possible choices for P € V and then finally outputs the instance (G',k') that maximizes 
\V(G)\ - \V(G')\. The total time taken by the DFPR is O((2 20s nlogn) + \V\ ■ 0(n + m) = 
O((2 20s (n + m)logn). This proves the following lemma. 

Lemma 8. Given an instance (G, k), the DFPR runs in time O((2 20s (n+m) logn), computes 
a collectionlZ of protrusions and outputs an equivalent instance (G',k'), such that \V(G')\ < 
\V(G)\-\K\. 

We now give a lemma analogous to Lemma [7] for the DFPR. 

Lemma 9. Let II be a problem that has a protrusion replacer which replaces r -protrusions of 
size at least r' , and let s > r' ■ 2 r . If G is a graph with a (s, 6s, r) -protrusion cover X , then if 
the RFPR is run on (G, k), the output instance (G', k') satisfies \ V{G) \ — \ V(G')\ > 22nrx~ n ■ 
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Proof. In the proof of Lemma [7] we showed that |V(G)| — |V(G')| is lower bounded by the 

number of sets that succeeds. Since each set X £ X has size at most 6s and |iV[X]| < 6s+r < 

7s it follows that for every X E X there is some coloring set P E V that makes X succeed. 

\x\ \x\ 

Hence there is a coloring P E V that makes at least jpj > 2r+6a+0 [ r _l 6a) logn sets succed. In the 
proof of Lemma [7] we showed that |V(G)| — |V(G')| is at least half the number of succeeding 
sets. Since 2 • 2 r + 6a +°( r + 6a ) logra < 2 20s we have \V(G)\ - \V{G')\ > 2 J^ ogn - □ 

We now proceed to prove that if G has a protrusion decomposition such that a linear 
fraction of the vertices appear in large enough r-protrusions then with high probability the 
Randomized Fast Protrusion Replacer will reduce G by a linear fraction of its vertices. To 
that end we need to have a closer look at the relationship between protrusion decompositions 
and protrusion covers. 

Protrusion Covers from Protrusion Decompositions. First we prove that in a graph 
of small treewidth we can always find protrusion covers with large size. 

Lemma 10. There exists a constant c such that for any integers n > s > b > 2 and n-vertex 
graph G of treewidth b, G has a (s, 6s, 2(6 + 1)) cover of size at least 

Proof. Let (T, B) be a nice tree-decomposition of G of width b. For a subset Q C V(T) by 
P(Q) we denote U q& QB q . For a rooted tree T, and a vertex v E T, a component C of T\{v} 
is said to be below v if all vertices of C are descendants of v in T. We start by constructing 
a set S C V(T) and a collection Q%, . . . , Qi S i of connected components of T \ S using the 
following greedy procedure. 

Let r be the root of T. In the beginning S = and T r = T. We maintain a loop invariant 
that T r is the connected component of T \ S that contains r. Now, at step i of the greedy 
procedure we pick a lowermost vertex Vi in V{T r ) such that there is a connected component 
Qi of T r \ {vi} below v% such that |P(Qj)| > 3s + 7(6 + 1). Now we add Vi to S and update 
T r accordingly. The procedure terminates when no vertex v in T r has this property. In 
particular, if for any v E T r , every component Q of T r \{v} below v, \P(Q)\ < 3s + 7(6 + 1), 
the procedure terminates. Since (T, B) is a nice tree decomposition, we have that for any 
vertex v E T r and parent u of v, if C v and C u are the components of T r \ {v} and T r \ {u} 
maximizing |P(C„)| and |P(C U )| respectively, then \P{C U )\ < 2\P(C V )\. Hence we know that 
for every component Q of T \ S, \P(Q)\ < 6s + 14(6 + 1) < 20s. This bound holds both for 
the components included in the collection Qi, . . . , and the ones that do not. 

Having constructed S and Q%, . . . we let S' = LCA-closure(S'). By Lemma [T] 

we have \S'\ < 2\S\. Let S* = S' \ S. Since \S*\ < \S\, at most Jf of the compo- 
nents Qi, . . . ,Qis\ contain at least two vertices of S*. This implies that at least ^ of 
the components Qi, . . . , Q\g\ contain at most one vertex of S* . Without loss of generality, let 
Qi,..., Q\s\/2 contain at most one vertex of S* each. For every i < \S\/2, if Qi contains no 
vertex of S* then Q\ = Qi is a component of Q\S' with |P(Q-)| > 3s + 7(6 + 1) > s + 2(6 + 1). 
If Qi contains one vertex v of S*, since v has degree at most 3 and \P(Qi)\ > 3s + 6, Qi \ {v} 
has at least one component Q[ with |P(Q^)| > s + 2(6 + 1). Thus we have constructed a set 
5' and a collection of components Q' ly . . . , Q'\ S u 2 of T \ S' of size at least s + 2(6 + 1). By 
Lemma [T] every Q\ has at most two neighbors in T. 

We make a collection Z as follows. For every i < \S\/2 let Z, = P(Q'j) \ P{S'). Since 
Q\ has at most two neighbors in T it follows that iV[Zj] is a 2(6 + l)-protrusion and that 
\Zi\ > s + 2(6 + 1) - 2(6 + 1) = s. We have already shown that |Q'J < 20s so |^| < 20s as 
well. Hence Z is in fact a (s, 6s, 2(6 + l))-protrusion cover of G. It remains to lower bound 
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\Z\. We have that \Z\ = ISI/2. Furthermore we have that S, together with the connected 
components of T\S cover T. Since every bag has size at most (6+1) < s, T\S has at most 
2|S'| + 1 < SIS'! connected components and for every component Q of T\S, \P(Q)\ < 20s we 
have that \S\(b + V) + 3\S\- 20s > n. Since s > b + 1 this implies that \S\> j^. □ 

Lemma 11. If G has an (a, (3) -protrusion decomposition, then for every s > (3, G has a 
(s, 6s, 3(/3 + 1)) -protrusion cover of size at least — a. 

Proof. Let Rq, . . . Rt be an (a, /3)-protrusion decomposition of G. At most a vertices are in 
Rq, and at most a ■ s vertices are in sets Ri for i > 1 such that \Ri\ < s. For each i > 1 



such that \Ri\ > s we apply Lemma 10 and obtain a (s, 6s, 2(/3 + l))-protrusion cover Z{ in 
Gr[i?j]. We let Z be the union of all the Z^s constructed in this manner. For every Z 6 Zi, 
N G [ R .-i[Zi] is a 2(/3 + l)-protrusion in G[i?j]. However Z might have neighbors also in Rq. 
The number of neighbors of Z in Rq is at most (3 and hence N[Z] is a 3(/3 + l)-protrusion in 
G. We conclude that Z is a (s, 6s, 3(/3 + l))-protrusion cover in G. The size of Z is at least 



n—a—a-s 



> t^77 - a. □ 



122s — 122 

The Fast Protrusion Replacer Theorems We are now ready to prove our main results 
on Fast Protrusion Replacement. 

Theorem 5 (Randomized Fast Protrusion Replacer Theorem). Let II be a problem that 
has a protrusion replacer that replaces r protrusions of size at least r' , and let s and (3 be 
constants such that r > 3(/3 + 1) and s > 2 r -r' . Given an instance (G, k) as input, the RFPR 
will run in time 0(n + m) and produce an equivalent instance (G',k') with |V(G')| < |V(G)| 
and k' < k. If additionally G has a (a, f3) -protrusion decomposition such that a < then 

n 

with probability at least 1-e ^oo^+i]^ we h ave \ V(G) \ - \ V(G')\ > 10Q0( " +1)6s . 

Proof. The first part of the statement follows directly from Lemma [6j If G has a (a, j3)- 
protrusion decomposition such that a < ^Ws^ then by Lemma 11, G has a (s,6s,3(/3 + 1))- 



protrusion cover X of size at least —a> 2Ss- Plugging X into Lemma [7] yields that with 

probability at least 1-e ^+^ s > 1 - e 2ooo S (r+i)B« we have |V(G)| - \V(G')\ > 4(r ^ )Ba > 

n i—i 
1000(r+l) es ' LJ 

Theorem 6 (Deterministic Fast Protrusion Replacer Theorem). Let H be a problem that 
has a protrusion replacer that replaces r protrusions of size at least r' , and let s and j3 be 
constants such that r > 3((3 + 1) and s > 2 r • r' . Given an instance (G,k) as input, the 
DFPR will run in time O(2 20s • (n + m)logn) and produce an equivalent instance (G',k') 
with \V(G')\ < \V(G)\ and k' < k. If additionally G has a (a, (3) -protrusion decomposition 
such that a < ^ then we have \V(G)\ - \V{G')\ > 244 . 2 2o s logn • 

Proof. The first part of the statement follows directly from Lemma [8j If G has a (a, /?)- 
protrusion decomposition such that a < ^Ws^ then by Lemma 11, G has a (s,6s,3(/3 + 1))- 



protrusion cover X of size at least — a > ^5 • Plugging X into Lemma [9] yields that 
\V(G)\ - \V(G>)\ > > 2U ^ slogn . □ 

It can be shown that Theorem [5] could replace the simple protrusion reduction algorithm 
of Lemma[2]and make thus Algorithm[2]run in linear time. However we are first going to refine 
Algorithm [2] even further so that it becomes simultaneously single exponential parameterized 
algorithm and an approximation algorithm for ^-DELETION for all connected J- £ & ' . To 
that end we develop the notion of lossless protrusion replacement. 
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5 Lossless Protrusion Replacement 



In this section we develop the notion of lossless protrusion replacement. We consider CM SO 
vertex subset problems. In a min-CMSO vertex subset problem, II, we are given a graph 
G as input. The objective is to find a set S C V(G) minimizing \S\ such that such that 
the CMSO-expressible predicate Pu(G,S) is satisfied. Similarly, in a max-CMSO vertex 
subset problem, II, we are given a graph G as input. The objective is to find a set S C V(G) 
maximizing \ S\ such that the CMSO-expressible predicate Pu(G, S) is satisfied. Given a min- 
CMSO (max-CMSO) vertex subset problem, II and an input graph G to II, by OPT(G) 
we denote the size of the smallest (largest) set S such that the CMSO-expressible predicate 
Pu(G, S) is satisfied. Next we define the notion of a lossless protrusion replacer. A lossless 
protrusion replacer is essentially a protrusion replacer that reduces protrusions in such a way 
that any feasible solution to the reduced instance can be changed into a feasible solution of the 
original instance without changing the gap between the feasible solution and the optimum. 
The notion of lossless protrusion replacement is central in our approximation algorithms. 

Definition 3 (Lossless Protrusion Replacer). A lossless protrusion replacer for MIN-CMSO 
^MAX-CMSO,) vertex subset problem IT is a family of algorithms, with one algorithm for 
every constant r. The r 'th algorithm has the following specifications. There exists a constant 
r' (which depends on r) such that given an instance G and an r -protrusion X in G of size 
at least r' , the algorithm runs in time 0{\X\) and outputs an instance G' with the following 
properties. 

• G' is obtained from G by replacing X by a r-boundaried graph X' with less than r' 
vertices and thus \V(G')\ < \V(G)\. 

• OPT{G') < OPT(G). 

• There is an algorithm that runs in 0(\X\) time and given a feasible solution S' to G 1 
outputs a set X* C X such that S = (S' \ X') U X* is a feasible solution to G and 
\S\ <\S'\+OPT(G) -OPT(G r ). 

We would like to give sufficient conditions for a problem to have a lossless protrusion 
replacer. An ideal setting would be that every graph optimization problem that has finite 
integer index when parameterized by the size of the optimal solution has a lossless protrusion 
replacaer. Unfortunately such a theorem seems to be out of reach, and it is quite possible 
that this is not true. However, in [8] a sufficient condition is given for a CMSO vertex 
subset problem to have finite integer index. This condition is called strong monotonicity and 
it is proved that every CMSO vertex subset problem that is stronly monotone has finite 
integer index and hence has a protrusion replacer. It turns out that strong monotonicity 
is a sufficient condition for a CMSO vertex subset problem to not only have a protrusion 
replacer, but also a lossless protrusion replacer. We now prove this fact. 

Let II be a min-CMSO problem and Tt be the set of pairs (G,S) where G is a t- 
boundaried graph and S C V(G). For a i-boundaried graph G we define the function 
(g '■ Ft — > NU {oo} as follows. For a pair (G', S') £ Tt, if there is no set S C V(G) such that 
P n (G ®G',SU S') holds, then Cg((G", S')) = oo. Otherwise ( G {(G', S')) is the size of the 
smallest S C V(G) such that P U (G ®G',5U S') holds. If II is a max-CMSO problem then 
we define ( G ((G', S')) to be the size of the largest S C V{G) such that P U {G ®G',SU S') 
holds. If there is no set S C V(G) such that P n (G®G', SUS') holds, then ( G ((G',S')) = oo. 

Definition 3 (|8j). A min-CMSO problem IT is said to be strongly monotone if there exists 
a function f : N — >• N such that the following condition is satisfied. For every t-boundaried 
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graph G, there is a subset S C V(G) such that for every (G' , S') G Ft such that Cg((G' , S')) 
is finite, P n (G®G',SU S') holds and\S\ < ( G ({G' , S')) + f(t). 

Definition 4 (|8j). A max-CMSO problem II is said to be strongly monotone if there exists 
a function f : N — > N such that the following condition is satisfied. For every t-boundaried 
graph G, there is a subset S C V(G) such that for every (G',S') G Ft such that Cg((G' ', S')) 
is finite, Pn(G®G',SUS') holds and \S\ > Cg((G', S')) - /(t). 

Theorem 7. Every min-CMSO or max-CMSO vertex subset problem IT, that is also 
strongly monotone admits a lossless protrusion replacer. 

Before proving the theorem we will need an auxiliary lemma. 

Lemma 12. If a graph G contains an r-protrusion X where \X\ > c > 0, then it also 
contains a (2r + 1) -protrusion Y where c < \Y\ < 2c. Moreover, given X we can compute Y 
and a tree decomposition ofY of width < 2r in 0(\X\) time. 

Proof. Let (T, X) be a nice tree decomposition of G[X] rooted at a node r. We can compute 
(T, X) from G[X] in time 0(|A|) using Bodlaender's algorithm |6J. If \X\ < 2c, we are done. 
Given a vertex x of the rooted tree T, we denote by T>t{x) the subset of V(T) containing x 
and all its descendants in T. Let Bt be the set containing each vertex x of T with the property 
that the vertices appearing in \J yE x> T (x) X y (i.e. the vertices of the nodes corresponding to 
x and its descendants) are more than c. As \X\ > 2c, Bt is a non-empty set. We choose 
b to be a member of Bt whose descendants do not belong in Bt 1 - This choice of b ensures 
that c < | Uj/6X>(6) X y \ < 2c. We define Y = OqX U \J y ^D T n,)X y . As G[Y] is an induced 
subgraph of X it follows that tw(G[y]) < r. Furthermore 8g(Y) C OqX U X&, therefore Y 
is a (2r + l)-protrusion of G. □ 

Proof of Theorem^ We prove the theorem for min-CMSO problems; the proof for MAX- 
CMSO problems is similar. Let II be a monotone min-CMSO problem. We define a partial 
order <n on pairs (G, S) such that G is a t-boundaried graph and S C V(G). We say that 
(G, S) <n (G", 5') if for every (G 3 , S 3 ), Pn(G ®G 3 ,SU S 3 ) -»• P n (G' G 3 , 5' U 5 3 ). We say 
that that (G, S) = u (G',S') if (G,5) < n (G',5') and (G',S') < n (G,5). Clearly = n is an 
equivalence relation and since Pji is a CMSO-expressible predicate it follows from |10|I16| that 
for every fixed t, =n has finitely many equivalence classes. Thus there exists finite set S of 
pairs (Gr, Sr) such that for every (G, S) there is a (Gr, Sr) G 5 such that (G, 5) = (Gjj, 
We say that a pair (G, S) is 6ad if there is no (G ; , 5') such that P n (G ©G',5U 5') holds. A 
pair that is not bad is called useful. Let IA be the set of all useful pairs in S. 

For a graph G and pair (Gr,Sr) £ U define ^g(Gr, Sr) to be the size of the smallest 
set S C V(G) such that (Gr,Sr) <n (G,S). If no such set 5 exists, jg{Gr, Sr) = oo. We 
now prove that for any G, the maximum finite value of 'jo an d the minimum (finite) value 
of 7g differs by at most f(t). Let S C V(G) be the set such for every (G',S ! ) £ Ft such 
that Cg((G', 5')) is finite, P n (G © G', 5 U S') holds and |5| < Cg((G', 5')) + /(*)■ Consider a 
useful pair (Gr, Sr) G such that jg(Gr, Sr) is finite. Then there exists a set 5' C V(G) 
of size jg{Gr, Sr) such that (Gr,Sr) <n (G,S'). Since (G,S') <n (G,S) and 5" is the 
smallest set such that (Gr,Sr) <n (G,S') it follows that |S"| < |5|. On the other hand 
since (Gr,Sr) is useful there exists some (G*,S*) such that -Pn(G# © G*,Sr_ U 5*) holds. 
Then P n (GffiG*,y U5*) holds as well and hence Cg((G*, S*)) < \S'\. Since Cg((G*,S*)) is 
finite it follows that \S\ < ( G ({G* , S*)) + f{t) < \S'\ + f{t). But this means that \S\-f(t) < 
1g(Gr, Sr) < \S\ and so the finite values of 7c differ by at least f(t). By the pigeon 
hole principle there exists a finite collection 1Z of t-boundaried graphs such that for any 
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i-boundaried G there is a Gr G 72. and a constant cr > such that for every useful pair 
(G', 5"), jq(G', S') = 7g r (G / , 5") + cr. We call 72 a set o/ representatives for (II, t). 

For every integer c we define a relation -< c on i-boundaried graphs. We say that G\ -< c G2 
if for every useful pair (G, S), 7 Gl (G, S) + c = 7c 2 (G, S). Observe that if G\ -< c G2 then 
G2 ~<—c G\. Also, we have just shown that for every G there is a Gr G 72 and constant 
cr > such that Gr < Cr G. We now show that if G < c G' then for any t-boundaried graph 
G3 and feasible solution S to II on G © G3, there is a set X* C V(G') depending only on 
5 n V(G) and G such that 5' = I*US\ V(G) is also a feasible solution to II on G' © G 3 
and \S'\ < \S\ + c. 

Let G -< c G' and consider a i-boundaried G3 and a feasible solution S of II on GffiG3. Let 
Sg = S n V(G) and S3 = S \ S G - (G, S g ) is a useful pair and so there is a pair (Gr, Sr) G U 
such that (Gr, Sr) = n (G, S G ). Thus 7 G (Gr, Sr) < |S G | and hence jg'(Gr, Sr) < \S G \ + c 
There is a set X* C V(G') such that (Gr, Sr) <n (G',X*) and |X*| < |S G | +c. The set X* 
depends solely on (Gr, Sr) which depends solely on S n V(G) and G. Furthermore, since 
(Gr,Sr) <n (G',X*) we have that S' = X* U S3 is also also a feasible solution to II on 
G © G 3 and \S'\ < \S G \ + c + |5 3 | < |5| + c. 

We can now describe the lossless protrusion replacer for the problem II. For parameter 
r consider the set 72 of representatives for (II, 2(r + 1)). Let r' be the size of the largest 
graph in 72 plus one. The lossless protrusion replacer for II will reduce r-protrusions of size 
at least r' . Given an r-protrusion X of size at least r' we find a 2(r + 1) protrusion Y Q X 



such that r' < \Y\ < 2r' . This can be done in 0(|X|) time by Lemma 12 Consider now 



the 2(r + l)-boundaried graph G^ S ^ Y )- There exists a 2(r + l)-boundaried graph Gr G 72 
and constant cr > such that Gr < Cr G^^^yy Furthermore since \Y\ > r' we have that 
|V(Gr)| < \Y\. The protrusion replacer outputs the graph G obtained by replacing Y by 
Gr in G. 

For every subset Sr C V(Gr) such that the pair (Gr, Sr) is useful, the protrusion 
replacer stores a subset Sy C Y such that (Gr, Sr) <n (GyKstY)' Since Gr -< Cr G^ 5( . y ^ 
there is such a set Sy of size at most \Sr\ + c. Now, for any feasible solution S in G' let 
Sr = SUV(Gr). The pair (Gr, Sr) is useful and so the lossless protrusion replacer outputs 
the set X* = Sy which it has stored for Sr. Now S' = Sy U (S\V(Gr)) is a feasible solution 
to G because (Gr,Sr) <n (Gy^y), Sy). Furthermore, since |Sy| < \Sr\ + c we have that 
|S'| < |Sy| + |S \ V(G R )\ < \S R \ + c + |S \ V(Gij)| < |S| + c. Thus it remains to prove that 
c < OPT(G) - OPT(G), or in other words that OPT(G') < OPT(G) - c. 

However Gyf 5 /y% ^- CB Gr, and hence for an optimal solution S of G = GyKgry^ © 
Gy/ G \\Y(Y) there is a feasible solution S' in Gr © G , y( G )\y(^) of size at most |S| — cr. 
Hence OPT(G') < OPT(G) - c and the theorem follows. □ 

Inserting a lossless protrusion replacer instead of a normal protrusion replacer into the 
Fast Protrusion Replacer algorithms directly yields the following theorems. 

Theorem 8. Let II be a minimization (maximization) problem that has a lossless protrusion 
replacer that replaces r protrusions of size at least r' , and let s and /3 be constants such that 
f > 3(/3 + 1) and s > 2 r • r'. Given an instance G as input, the Randomized Fast Protrusion 
Replacer will run in time 0(n + to) and produce an instance G with \V(G)\ < \V(G)\. 
Given any feasible solution S' to G a feasible solution S of G of size at most (at least) 
\S'\ —OPT(G') + OPT(G) can be computed in 0(n + m) time. If additionally G has a (a,f3)- 

n 

protrusion decomposition such that a < 2Ws' ^ en with probability at least 1 — e 2000s ( r+1 ' 6s 
we have \V(G)\ - \V(G)\ > 1000(r " +1)6s ■ 
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Theorem 9. Let II be a minimization (maximization) problem that has a lossless protrusion 
replacer that replaces r protrusions of size at least r' , and let s and (3 be constants such 
that r > 3(/3 + 1) and s > 2 r • r' . Given an instance G as input, the Deterministic Fast 
Protrusion Replacer will run in time O(2 20s • (n + m)logn) and produce an instance G' 
with \V(G')\ < |V(G)|. Given any feasible solution S' to G' a feasible solution S of G 
of size at most (at least) \S'\ — OPT(G') + OPT{G) can be computed in 0(n + m) time. 
If additionally G has a (a, (5) -protrusion decomposition such that a < then we have 
\V(G)\-\V(G')\> 2M . 2 Z logn - 

6 Approximation and Fast Parameterized Algorithm for p-T- 
Deletion 

We are now ready to give the linear time, lossless variant of Lemma [2] Throughout this 
section OPT(G) is the size of the smallest J-~-deletion set of G, for the set J- currently under 
consideration. First we give an auxiliary lemma analyzing an execution of the Lossless RFPR 
on a graph with an ^-deletion set S. 

Lemma 13. For every connected J- € ^ , there exist constants p, r, s, c < 1 and 7 > such 
that if we run the Lossless RFPR with parameters r, s on a graph G which has a T deletion 
set S which is not a p-cover, then with probability at least 1 — e _7ra the output instance G' 
satisfies V(G') < |V(G)|(1 - c). 

Proof. If G has a ^-deletion set S' which is not an p-cover, it also has a inclusion minimal 
J-'-deletion set S which is not an p-cover. Such a minimal S contains no isolated vertices and 
hence satisfies N[S] < 2 XXeS d( v ) < 2pm. 

By Proposition [IJ there exists a constant b such that tw(G\ S) < b. By Lemma[5j G has 
a (4(6 + l)\N[S]\,2(b + l))-protrusion decomposition. Set /3 = 2(b + 1), r = 3(/3 + 1) and r' 
to be the smallest integer such that the lossless protrusion replacer will replace r-protrusions 
of size at least r' . Set s = 2 r ■ r' . The protrusion decomposition of G is a (4(6+ l)|iV[5]|, (3)- 
protrusion decomposition. By Theorem [8] there exist constants < c < 1 and < 7 such 
that if we run the Lossless RFPR on G and 4(6 + l)|iV[S']| < 2335 then with probability at 
least l-e" 7n , the output graph G' satisfies \V(G)\ - \V(G')\ > c\V(G)\. We show that there 
is a constant p < 3 such that if S is not a p-cover, then (^[S 1 ]) < 10 oo(b+i)s • 

Since tw(G \ S) < b we have that G \ S is (6 + l)-degenerate. If S is not a p-cover 
then m < n(6 + 1) + X^es d(v) < n(b + 1) + 2pm. Rearranging yields that N[S] < 2pm < 
n MtQl < np6 (5 + 1). Choosing p = 6000(6 + l) 2 s yields that \N[S]\ < 1000( w b+ i) s - Hence, 
if S is not a p-cover then with probability at least 1 — e _7n the output instance G' of the 
Lossless RFPR satisfies V(G') < \V(G)\(1 - c). □ 

Lemma 14. For every connected T G & there is an algorithm that given a graph G, takes 
0(n + m) time and outputs a graph G' such that V(G') < V(G) and OPT(G') < OPT(G). 
Given a T -deletion set S' of G' the algorithm can compute an T -deletion set S of G of size 
\S'\ + OPT{G) — OPT{G') in time 0{n + m). Furthermore there exist a constant < p < 1 
such that with probability at least ^, every J- '-deletion set S' of G' is a p-cover of G. 



Proof. By Lemma 13 there exist constants p, r, s, c < 1 and 7 > such that if we run 
the Lossless RFPR with parameters r, s on a graph G which has a J- deletion S which 
is not a p-cover, then with probability at least 1 — e _7n the output instance G' satisfies 



V(G') < |y(G)|(l — c). We set these constants as guaranteed by Lemma 13 
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The algorithm sets G\ := G, i = 1 and enters a loop that proceeds as follows. The 
algorithm runs the Lossless RFPR on Gi with parameters r and s, let the output of the 
Lossless RFPR be G i+ \. If \V(G i+ i)\ > |V(Gj)|(l - c) the algorithm halts and outputs Gi. 
Otherwise, the algorithm increments i and returns to the beginning of the loop. 

The total time spent by the algorithm is upper bounded by a geometric series, and so 
the running time of the algorithm is 0(n + m). Similarly, by repeatedly applying Theorem [8] 
we can in linear time transform any J-*-deletion set Si of Gi back into a J-"-deletion set S of 
G of size at most \S'\ + OPT{G) — OPT(G'). It remains to prove that when the algorithm 
terminates, with probability at least | we have that every J-"-deletion set S' of G' is an p-cover 
of G. 

The algorithm makes t = O(logn) calls to the Lossless RFPR. For % < t + 1 let rii = 



\V(Gi)\. In call i, by Lemma 13, if Gi has an ^-deletion set S which is not a p-cover 
then the probability that V(Gi+i) > V(Gi)(l — c) is at most e~" /n '. By the union bound 
the probability that this occurs at some step i is Y^i<t e ~ ini • The n *' s are a decreasing 
geometric series and so for a sufficiently large (constant) N we have that if m > N then 

E J <t e ~ 7 " 1 < 2e ~ 7nt < 1/2. 

Finally, if nt < N then any non-empty set S is a cover, and so if p > p we can 
adjust p to -A?. This proves the lemma. □ 

We are now ready to give the algorithm which is the main engine behind both our 2°^n 
time algorithm and the quadratic approximation algorithm for ^-DELETION for connected 
sets T G 



Randomized- J r -Deletion(G) 

Set G\ := G and i := 1 

While {Gi is not J-"-free) do as follows: 



1. Apply Lemma 14 on Gi and obtain a new graph G' { 

d Q i (u) 

2. Pick a vertex u, L G V{G' i ) at random with probability 2 \e(g')\ • := \ 

3. Increment i by 1. 

Set S< = 

For j = i downto 2: 

1. Set S'^-l := Sj U{n i _i}. 

2. Apply Lemma 14 on G^- and Sj and obtain a set Sj. 



Output S := S 



Figure 3: Randomized Algorithm for J 7 - DELETION for connected T 6 & 



We say that a round of Algorithm [3] is an iteration of the while-loop. Round x is the itera- 
tion when the value of i is x. The algorithm suceeds in round i if OPT{G' i ) = OPT{Gi + \) + 1 
and it fails in round i otherwise. The number of rounds of a run of Algorithm [3] is the maxi- 
mum value i takes. We make a series of observations about Algorithm [3j For every i we have 
that |V(GJ)| < |V(Gi)| and |F(G i+ i)| < \V{Gfi\. Hence we make the following observation. 

Observation 1. Algorithm^ terminates after at most n rounds. 
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The next observation follows directly from Lemma 14 
Observation 2. The time taken in each round and each iteration of the for loop is 0{n+m). 

Next we prove that the algorithm always outputs feasible solutions. 
Observation 3. Algorithm^ outputs an J- '-deletion set of G. 

Proof. Let t be number of rounds. We have that Gt is F-free and so St = is a ^-"-deletion 
set of Gt- If Sj is a ^-deletion set of Gj then S'j = Sj U {uj-i} is a ^-deletion set of G'j_ 1 . 



Then, by Lemma 14, Sj-i is a J-"-deletion set of Gj-\. Hence, by downward induction on j, 



S\ is a ^-deletion set of G\ = G. □ 
Next we upper bound the size of the output solution S. 

Lemma 15. Let p be the number of rounds in which Algorithm^ fails. Then the size of the 
output solution S is \S\ = OPT(G) +p. 

Proof. For every x, define f x to be the number of rounds i > x such that the algorithm fails 
in round i. Let t be the be number of rounds. We prove by downward induction on i that 
\St\ = OPT(Gi) + fi. Since \S t \ = \ft\ = \OPT(G t )\ = this clearly holds for t. Consider 
now some i < t such that the equation holds for i + 

If the algorithm succeeded in round i we have that |S^| = |5j+i| + 1, that OPT{G' i ) = 
OPT{G l+1 ) + 1 and that /< = f i+1 hence |^| = \S i+1 \ + 1 = OPT(G t+1 ) + f i+1 + 1 = 
OPT{G' i ) + fi. On the other hand if the algorithm fails in round i we have \S[\ = |Si+i| + 1, 
that OPT(G§ = OPT(G i+1 ) and that ft = f i+1 + 1. Then |^| = |S m | + 1 = OPT{G i+1 ) + 
f i+l + 1 = OPT(G' 4 ) + fi. Hence in both cases we have that \S[\ = OPT{G' i ) + fr. By 



Lemma[l4]we have that \Si\ = + OPT(Gi) - OPT(G' i ) = OPT(Gi) + fr. This concludes 
the proof. □ 

Now we lower bound the success probability of any round i of Algorithm |3j 

Lemma 16. There is a constant p > such that the probability that Algorithm^ succeeds 
in any given round i is at least p. 



Proof. By Lemma 14 there is a constant p such that with probability 1/2, every ^-deletion 
set of G\ is a p-cover. Let 5* be an optimal ^-deletion set of G\. If Uj G S* then S* \ u; L 
is an optimal J^-deletion set of G\ \ Ui = Gi + \. So if Uj G S* then the algorithm succeds in 
round i. If S* is a p-cover of G\ then the probability that Ui G S* is at least p. Hence the 
probability that every ^-deletion set of G\ is a /)-cover and Ui G S* is at least p = p/2. □ 

In each round Algorithm [3] succeeds probability at least p. In a round i where the 
algorithm succeeds we have that OPT{GiJ r \) < OPT(G). Since the algorithm terminates 
when OPT(Gi) = we get the following observation. 

Observation 4. There exists a constant p > such that the expected number of rounds of a 
run of Algorithm j^j is at most ^OPT(G). 

Since the number of rounds where Algorithm [3] fails is at most the total number of 



rounds it follows form Lemma 15 that the expected size of the output solution l^l is at most 



OPT(G) + ^OPT(G). This proves the following lemma. 

Lemma 17. For every connected J- G ^ , Algorithm^ runs in time 0(n + m), expected 
time 0((n + m)OPT{G)) and outputs an F solution S with E\\S\] = c ■ OPT{G) for some 
constant c. 
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While Lemma 10 only gives constant factor approximation algorithms for ^-Deletion 
for connected T G J^", we can use this approximation algorithm to make an approximation 
algorithm for all J 7 € & " . 

Theorem 10. For every T E ^-DELETION has a constant factor approximation running 
in time 0(nm) and expected time 0{{n + m)OPT(G)). It outputs a feasible solution S with 
expected size c • OPT(G) for a constant c. 

Proof. By Proposition [TJ for every T G & there is a constant 77 such that for any J-"-deletion 
set S of G we have tw(G\S) < r\. Since Treewidth 77-DELETION is a ^"-Deletion problem 



for a connected T 1 6 & it follows from Lemma 10 has a constant factor approximation with 
the desired running time. We run this algorithm and find a set S' such that tw(G \ S') < rj. 
We have that E[\S'\] = 0{OPT(G)), where OPT(G) refers to the size of the smallest T- 
deletion set in G. Since tw(G \ S') < r/ we can solve ^-Deletion on G \ S in linear time 
and find a set S* of size OPT(G \ S') < OPT(G). We return S = S' U S*, S is a F-deletion 
set of G with expected size 0{OPT(G)). □ 

Interestingly we can also use Algorithm [3] to give a fast randomized FPT algorithm for 
P-J'-Deletion. 

Theorem 11. For every connected J- G J?", p- ^-Deletion /jos a randomized 0(c k n) time 
algorithm. Given a yes instance the algorithm finds a solution and outputs it with probability 
1/2. If the algorithm outputs a solution, it is a feasible solution of size at most k. 

Proof. We modify Algorithm [3] in the following way; if Gfc+i is not .F-free then output "no" 
and halt. If the size of the output solution is more than k then output "no" instead. The 
algorithm runs for at most k + 2 rounds so the total running time is at most 0(nk). If it 
outputs a solution S then S is an T deletion of size at most k. We prove that if G has an T 
deletion of size at most k then the algorithm will output a solution with probability at least 
-£+2 fo r a constant c. Repeating this algorithm 0((l/c) k ) times and outputting a solution if 
either iteration does then proves the theorem. 

In each round, the probability that Algorithm [3] succeeds is at least p for some constant 
p. Thus the probability that Algorithm [3] succeeds in all its rounds before it terminates (after 
at most k + 2 rounds) is at least p k+2 . If the algorithm succeeds in all rounds and outputs a 
solution then this solution is optimal and hence has size at most k if (G, k) is a yes instance. 
Finally, if the algorithm succeeds for k+2 rounds then OPT(Gi) > OPT(G 2 ) ■ ■ ■ OPT(G k+2 ) 
and so OPT{G\) > k + 1. Hence, if (G, k) is a "yes" instance and the Algorithm [3] succeeds 
in all of its rounds then it will output a solution of size at most k before terminating. This 
concludes the proof. □ 

7 Deterministic Parameterized Algorithms for p- ^-Deletion 

We now give a deterministic 0(c k n log 2 n) time FPT Algorithm for p-F-Deletion for all 
connected T £ & . 

Lemma 18. For every connected T G & , there exist constants p, r , s, c < 1 such that if we 
run the DFPR with parameters r , s on an instance (G, k) such that G has a J- deletion S 
which is not a p-cover, then the output instance (G\ k') satisfies \ V{G)\ — \ V(G')\ > [~Ty7gyi • 

Proof. If G has a F-deletion set S' which is not an p-cover, it also has a inclusion minimal 
F-deletion set S which is not an p-cover. Such a minimal S contains no isolated vertices and 
hence satisfies N[S] < 2 Ylves d( v ) — 2pm. 
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By Proposition [TJ there exists a constant 6 such that tw(G \ S) < 6. By Lemma [5j G 
has a (4(6 + l)|iV[5]|,2(6 + l))-protrusion decomposition. Set (3 = 2(6+1), r = 3(/3 + l) and 
r 1 to be the smallest integer such that the protrusion replacer will replace r-protrusions of 
size at least r' . Set s = 2 r • r' . The protrusion decomposition of G is a (4(6 + l)|iV[S]|, (3)- 
protrusion decomposition. By Theorem [6] there exist constants < c < 1 and < 7 such 
that if we run the DRFPR on G and 4(6 + l)|A r [S']| < then the output graph G' satisfies 

|V(G)| — |V(Cr')| > c ^pg^^ ■ We show that there is a constant p < \ such that if S is not a 
p-cover, then \N[S]\ < 1000( n fe+1)s . 

Since tw(G \ S) < 6 we have that G \ S is (6 + l)-degenerate. If 5 is not a p-cover then 
m < n(b+l)+J2ves d( v ) < n{b+l)+2pm. Rearranging yields that N[S] < 2pm < n 2 i^p- < 
np6(6 + 1). Choosing p = 6000(6 + l) 2 s yields that \N[S]\ < 1000 g +1)s - Hence, if S 1 is not a 
p-cover then the output instance G' of the RFPR satisfies \V(G)\ - \V(G')\ > □ 

Lemma 19. For every connected J- € ^ there is an algorithm that given an instance (G, k), 
takes 0((n + m) log 2 n) time and outputs an equivalent instance graph (G ; ,k') such that 
V(G') < V(G) and OPT(G') < OPT(G). Furthermore there exist a constant < p < 1 
such that every J- '-deletion set S' of G' is a p-cover of G. 



Proof. By Lemma 18 there exist constants p, r, s, c < 1 such that if we run the DFPR 



with parameters r, s on an instance (G, k) such that G has a T deletion 5 which is not a 

c\V{G)\ 
log|V(G)|- 



p-cover, then the output instance (G',k') satisfies |^(G)| — |y(G")| > r^w^pvr- We set these 



constanst as guaranteed by Lemma [18| 

The algorithm sets (G\,ki) := (G,k), i = 1 and enters a loop that proceeds as follows. 
The algorithm runs the DFPR on (Gi,ki) with parameters r and s, let the output of the 
DFPR be (G i+1 ,k i+1 ). If \V(Gi)\ - |V(G i+ i)| < ijiSi the algorithm halts and outputs 
Gi. Otherwise, the algorithm increments i and returns to the beginning of the loop. 

One iteration of the loop takes time 0((|V(G»)| + \E(Gi)\) log |V((xi)|). Furthermore, 
every logn consecutive iterations of the loop reduces the number of vertices by a linear 
fraction. Hence the total running time us bounded by 0(nlog 2 n). Let (G',k f ) be the 



instance we output. By Lemma 18 we have that every ^-deletion set S' of G' is an p-cover 



of G. □ 



We will say that an instance (G, k) is irreducible if running the algorithm of Lemma 19 



when run on (G, k) just outputs (G, k) unchanged. Observe that if we run the algorithm of 



Lemma 19 when run on an instance (G, k), the instance (G' , k') output by the algorithm is 



irreducible. A direct consequence of Lemma 19 is that in an irreducible instance (G, k) every 
^-deletion set S in G is a p-cover. 

We now give a deterministic algorithm for p- ^-Deletion, for connected T £ & ' . The 
intuition behind this algorithm is that vertices of high degree seem more useful for a solution 
than the vertices of low degree. Towards this we introduce the notion of buckets. We partition 
the vertex set of G into sets that we refer to as buckets, in the following fashion. For every 
j > 1 define 



Bj = {veV(G) \£<d{v)<£ 1 }. 



We set constants n > and d > such that 4rf "^ 3?? < p. For the presentation of the 
algorithm we fix a ^-deletion set set X of size at most k. Next we define a notion of big and 
good for buckets. 

Definition 4. A bucket B{ is said to be big if \B>i\ > in and it is said to be good if\B>iC\X\ > 
d\Bi\. 
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Algorithm-FPT-Det(G,/c) 



Step 1: Check whether G is F-free, if yes then return(true). Else if k < and G is 
not F-free return that G does not have a /c-sized F-hitting set. 



Step 2: Apply Lemma 19 on (G, k) and obtain an equivalent irreducible instance 
(G*,k*). 

Step 3: Let Bj, j S {a, b, . . . ,£}, be the good buckets for G* . For every good bucket 
Bj, and for every subset S C Bj of size at least d\Bj\ check whether 
Algorithm-FPT-Det(G* \ {S},k — \S\) returns true. If any of these calls return 
true then return(true) else return(false). 



Figure 4: A 2°^nlog 2 n deterministic FPT algorithm for p-F-Deletion. 

The next lemma says that if {G, k) is a irreducible yes instance to p-F-Deletion then 
it has a bucket that is both big and good simulatenouly. 

Lemma 20. For any connected F £ ^ , let (G, k) be a irreducible yes instance to p-T- 
Deletion. Then G has a bucket that is both big and good. 

Proof. Since (G, k) a irreducible yes instance to p-F-Deletion every optimal F-hitting set 
A is a p-cover for G, that is, J2veV(G) d{ v ) < PJ2vex d(v). For a contradiction, assume that 
G does not have a bucket that is both big and good. 

log n 

v£X i=l veBillX 

E E d w+ E E d w 

{i\Biis not good} «6B,nX {i|Biis not big} veBitiX 

<d-Am+ ir >\ji) 
{i\Biis not big} 

Ad + 3n 

< d ■ Am + 3r/n = 2m < 2mp 

Which contradicts that A is a p-cover. □ 

Theorem 12. Let F G & be a connected obstruction set. There exists a determintistic algo- 
rithm for ^-F-Deletion running in time 0(c^n log 2 n) on a n vertex graph. The constant 
Ch only depends on F. 

Proof. The deterministic algorithm for £>-F-Deletion is described in details in Figure |4j 



Given a graph G, the algorithm essentially applies Lemma 19 to obtain G* and then recur- 



sively tries to compute the solution to the problem by branching on all large subsets of all 



the good buckets. The correctness follows directly from Lemma 20. Next we analyze the 
running time of the algorithm. Suppose for the sake of analysis that all buckets are big, and 
let a* be the size of bucket i. Then we have that 
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log" / \ 
log n 

T(k) < 2 ai T(k - den) 
i=i 

Assuming T{k) = x k , substitute recursively to get: 

log n 

T(k) < 2 a *x {k - da ^ 
i=i 

log" / 9 \ a t 
i=l V 7 

If Jj < 1 then each term of the sum is maximized when the exponent is as small as 
possible. We will choose x (based on d) such that \ < 1 holds. Since en > r\i for any big 
bucket we have that 

logn / o \ vi 
i=i v 7 

The sum above is a geometric series and converges to a value that is at most 1 for x = c, 
for a suitably lareg choice of c depending only on d and ij, which depended only on T . This 
bounds the running time by c k . Further, if not all buckets are big the sum above should only 
be done over the big buckets, yielding the same result. □ 



8 Conclusions and open problems 

The techniques developed in this paper have several interesting applications. Let us men- 
tion a few that we find particularly interesting. Fomin et al. (27j give linear kernels for 
bidimensional problems on apex- free and //-minor free graphs. The running time of their 
kernelization algorithms is 0(n h ) where h is a constant which depends on the considered 
graph class. The reason the kernelization algorithms have this running time dependence is 
that they employ naive protrusion replacement similarly to the implementation of Lemma [2] 
If we use the randomized fast protrusion replacer instead we get linear kernels for all bidi- 
mensional problems from [27] with linear time randomized kernelization algorithms. Using 
the deterministic fast protrusion replacer yields linear kernels in time 0(nlog 2 n). 

Most of the problems considered in [27] are CMSO-optimization problems that are strongly 
monotone. For these problems we can use the lossless and fast protrusion replacers to 
obtain linear kernels that are "lossless", in the following sense. If G is the original in- 
stance and G' is the instance output by the kernelization algorithm, then any feasible 
solution S' to G' can be lifted (in linear time) to a feasible solution S of G such that 
\\S\ — OPT{G)\ < \\S'\ — OPT(G')\. This makes the kernelization algorithms combine beau- 
tifully with approximation algorithms and heuristics for these problems. For an example 
combining such lossless kernels with approximation schemes yields 0(n ^ + f(e)OPT°^) 
time approximation schemes for many problems on minor free graphs. When input instances 
satisfy OPT « n but OPT is still too big to run parameterized algorithms, such approxi- 
mation schemes would be a viable option. 
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In the framework for obtaining EPTAS on H- minor-free graphs in [25], the running time 
of the approximation algorithms for many problems is /(1/e) • n°^ H '\ where g is some 
function of H only. The only bottleneck for improving the polynomial time dependence in 
|25j is Lemma 4.1, which gives a constant factor approximation algorithm for Treewidth 
7/-Deletion or ^-Transversal of running time n°^ 9<yH ^ . Instead of this algorithm we can 
apply the algorithm from Theorem [II which runs in time 0(n 2 ). This improves the EPTAS 
from [25] to run in time 0(/(l/e) -rr). For the same reason, the PTAS for many problems 
on unit disc and map graphs from [26] become EPTAS. 

In a companion paper [23] we show that for every T in ^ , p- ^-Deletion admits a 
polynomial kernel computable in time 0(n 3 ■ k c ) where c is a constant depending only on J-. 
This yields a deterministic algorithm for p- ^-Deletion with running time 0(2°( fcl °g fc ) n ) 
even for the families J- E ^ that contain disconnected graphs. 

An interesting direction for further research is to investigate p- ^-Deletion when none 
of the graphs in T are planar. The most interesting case here is when T = {K§, ^3,3} aka 
the Vertex Planarization problem. Surprisingly, we are not aware even of a single case 
of j»-J-"-Deletion with T containing no planar graph admitting either a constant factor ap- 
proximation, polynomial kernelization, or a parameterized single-exponential time algorithm. 
It is tempting to conjecture that the line of tractability is determined by whether T contains 
a planar graph or not. 

Acknowledgements. The authors are grateful to Bart Jansen for insightful discussions, 
and especially for pointing out that p-J-*-Deletion does not have finite integer index when 
the family T is not connected. 
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